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Abstract 

Some problems of statistics can be reduced to extremal problems of minimiz- 
ing functionals of smooth functions defined on the cube [0, 1]™, m > 2. In 
this paper, we study a class of extremal problems that is closely connected to 
the problem of testing multivariate independence. By solving the extremal 
problem, we provide a unified approach to establishing weak convergence for 
a wide class of empirical processes which emerge in connection with test- 
ing independence. The use of our result is also illustrated by describing 
the domain of local asymptotic optimality of some nonparametric tests of 
independence. 
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1 Introduction 



Let Xj = (Xji, . . . , Xim), m > 2, i = 1, . . . ,n,he independent random vectors with 
absolutely continuous distribution function (df) F and marginal df's -Fi, . . . ,Fm- 
An important problem of statistics is to test the hypothesis of (multivariate) inde- 
pendence 



Hq : F = F1F2 



(1) 



Many test statistics for testing multivariate independence converge weakly to- 
ward functionals of Gaussia n rand or n proces s es uri d er the hypothesis of i n depen - 



dence fsee, e.g.. IPeheuvelsl ( 1l98ll ): iNikitinI (jl995[ ): iNazarov and NikitinI ( 120001 ): 



Schmid and Schmidtl (j2007f )). If such functionals are regular then the limiting 
distributions of test statistics are easily derived. The knowledge of the limiting 
distribution is important for calculating (approximate) critical values and finding 
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asymptotic efficiency of the sequence of test statistics. Unfortunately, many inde- 
pendence tests have a complex structure, and the corresponding asymptotic theory 
is not easy to develop. This prevents the use of such tests. In this paper, by 
solving a multidimensional extremal problem (Theorem 1), we prove a result on 
the weak convergence of empirical processes (Theorem 2), which yields the limit 
distribution for a wide class of test statistics for testing independence. Such a class 
includes, as extreme cases, the multivariate versions of Cramer-von Mises and 
Blum-Kiefer-Rosenblatt test statistics (see Section 4.1). In addition, we illustrate 
the use of Theorem 1 for describing the domain of local asymptotic optimality of 
some nonparametric tests of independence (see Section 4.2). 

Let M = {1,2,..., m} and let 2*^ be the set of all subsets of M. In this paper, 
we study a class of extremal problems indexed by a subset M. of 2^^ that obeys 
certain restrictions. For m = 2 there exist three essentially different choices for 
7W, and the cl ass of extrem al problems is limited. Such problems were studied in 



( iNikitinl . Il995l Ch. 5) and iNazarov and Nikitiru (120001 ) in connection with calcu 



lating asymptotic efficiency of nonparametric tests of independence in a bivariate 
setup. In a general case of m > 2 the situation is more complicated, and the num- 
ber of possible extremal problems for an arbitrary m is not explicitly calculable 
(see Remark 1). 

The paper is organized as follows. In Sections 2 and 3 we formulate and solve 
the extremal problem of interest. The solution is obtained by reducing the extremal 
problem to a non-standard boundary- value problem and constructing a Green func- 
tion for the latter. This part of the paper, with the main result given by Theorem 1, 
is purely analytical and might be of independent interest for experts in PDE's. The 
rest of the paper is largely statistical, with a focus on various applications of The- 
orem 1 to the problem of testing independence. In particular. Theorem 1 provides 
a unified approach to establishing weak convergence for a wide class of empirical 
processes with a multidimensional time parameter which emerge in nonparametric 
statistics (Theorem 2). 

2 Extremal problem 

Let = [0, 1]™' and let be the space of (real-valued) functions that are m-times 
continuously differentiable with respect to each variable and obey the following 
boundary conditions: 

C^ = {ne C'^in : n(x)U^.=o = 0, J = 1, . . . ,m}, 

where x = (xi, . . . , Xm) G J™. Define a scalar product on C™ as follows: 

(f]i,fi2)= / u;i(x)a;2(x)rfx, n^,n2eC'^, (2) 
J/™ 

where uAx.) = — ^— — , i = 1,2. Denote by H"^ the closure of the space C?f 

OXi . . . OXm 

under the norm || ■ || induced by the scalar product (j2]). For any m > 2, H™ is a 
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Hilbert space whose proper ties are derived similarly to the case m = 2 studied in 
Nazarov and NikitinI (120001 ). More precisely, the following result holds true. 



Proposition 1 (a) The embedding of the space H"^ into the space 0(1™) is com- 
pact, (h) The embedding of the space H"* into the Sobolev space W2(/'") is compact. 

Part (a) of Proposition 1 implies that any function from H"* equals zero on any 
"left" side of the cube J™ adjacent to the origin. 
Consider the extremal problem 

II^IIh'" min, (3) 
fiGH™, / f](x)ciMx) = l, (4) 



where is a subset of H™' specified by certain boundary conditions on the "right" 
sides of the cube J™' adjacent to the point 1 = (1,1,...,1), and /x is a finite measure 
on 7™. In order to describe possible boundary conditions of this extremal problem 
we need some notation. 

Let M = {l,2,...,m} and let 2^^ be the set of all subsets of M. For any 
U C M, denote xjj the |[/|- dimensional vector x^/ = (xj : i G f/), where \U\ is the 
cardinality of U. Consider the set A4 C 2^ such that 

V f/ C r C 2*^ U eM implies V e M. (5) 

That is, if set U belongs to Ai, then all its "oversets" also belong to Ai. Define 
the subset of H™ as follows: 

Hli = {QeH"': ^^(x)U^=l =0, Ue M}. 

The reason for the requirement ([5]) is simple: if G H"* takes a zero value on the 
side Su = {x(7 = 1}, it also takes a zero value on all the subedges of Su of less 
dimension. 

For a set \^ = (zi, . . . , ii) and its complement (in M) = (ji, . . . , jk), l+k = m, 

put 

(9xy9Xyc = dXi-^ . . . ^Xi^^X^j^ . . . dx^j^. 

By the Lagrange principle rule (see, e.g., lAlexeev et al. the necessary 

condition of a minimum in ([2])^(11D is reduced to the Euler equation (in the sense 
of distributions) 



92'^^](x) 

' dx\ . . . dx^ 



\L =/^(x), (6) 



where A is the Lagrange multiplier, and the natural boundary conditions 



= 0, for a^nj V i M, V 0. (7) 

Xl/ = 1 
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Remark 1 For any U G 2*^ define an m- dimensional vector of Boolean variables 
(Vj = If? € U), i = 1, . . . , m). Then I{U G A^) is a monotone Boolean function (see 



Kleitman] (1l969l )). Denote by N{m) the total number of such functions. Obviously, 
the number of the above considered extremal problems is also equal to A^(m). So 
far, no explicit formula for N{m) as a funct ion of m has bee n found. For the 



asymptotic behavior of N{m), as m — )■ oo, see iKorshunovl (jl98ll ) 



3 The Green function of the problem 

Recall that Green function of the boundary- value problem dl]), ([6]), ([7]) is the func- 
tion ^_A4(x, ^) that satisfies, along with the boundary conditions, the equation (in 
the sense of distributions) 

where 5{x.) is the Dirac function. It is well known that solution to the problem 
&, ©, dZ]), and hence solution to the extremal problem ([3])-(jl]), can be expressed 
with the aid of Green function by the formula 

n(x) = A-i/ e;A,(x,orf/i(0, xG/'", (9) 
J/™ 

where the Lagrange multiplier A is found from the integral restriction in (jl]) and 
has the form 

A= // gM{^,OM^)di^{^). (10) 

For x, ^ G J™ and for V" C M as before, define the functions 

^y(x,^) = A;n(x,£) . . ./Ei^x,^), iryc(x,^) = is:ji(x,^) . . .Kj,(x,^), 

where 

kjiyi,$,) = Xjij, Kj{yi,$) =mm{xj,Q, j = l,...,m. (11) 
The first main result of the paper is stated as follows. 
Theorem 1 The Green function of the boundary-value problem (jlj), ([6]), ([7j) is 

gM{^,^)=KM{^,^)- J2^uKu4^,^)ku{^,^), (12) 

with the coefficients au defined recurrently by 

^ ay = 1, for all U E M. (13) 

vcu 
veM 
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Proof First, note that 



Therefore the function Qj^ satisfies ([H]) with an arbitrary choice of the constants 
au- 

Further, by ([5]) for any nonempty set V = {ii, . . . ,ii) ^ Jvl and for any U E Ai 
there exists j E nV , and hence 



9xi 



xv=i dx 



3 



0. 



Thus all summands in (fT2|) satisfy ([7]). 

In order to prove that the function Qm vanishes on {x[/ = 1} for some U E A4, 

we represent as the union = IJ 9?ty of the disjoint sets 

vdu 

mv = {w eM: WUU = V} 

(in the definition of Wly the basic property ([5]) is used). 
The key observation is that for any W G OJty, 

Therefore formula f|T2|) gives 

^A^(X, 1x^=1 = 1x^=1 ■ ~ XI ^^)~ 

- ^ iryc(x,^)A;y(x,^)|^^^^ ■ aw 

Due to (fT3i) the coefficients on the right-hand side vanish, and ^a^(x, ^)|^ = 0. 
This completes the proof. □ 



Remark 2 Return to the extremal problem and consider the following 

three sets of boundary conditions: (a) there are no restrictions on G H*" 
except for those that specify the space H™, (b) ^2 G H'" equals zero on any 
(m — l)-dimensional side of f", and (c) Q G H'" equals zero at the point 1 = 
(1, . . . , 1). Then M = ^, M = M = {M}, respectively, and by Theorem 1 
the corresponding Green functions are YYjLiKj{'x,$,), n^i(-^i(^'^) ~ ^i(^)^))) 
YYjLi -^i(^) ^) "rijli ^j(^) These are covariance functions of the classical Gaus- 
sian random processes. They correspond to an m-dimensional Brownian sheet, an 
m-dimensional Brownian pillow, and an m-dimensional tucked Brownian sheet, re- 
spectively. The latter two arise as limiting processes in nonparametric testing of 
independence (see Section 4.1 for details). 
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4 Connection to testing independence 



This section illustrates the use of Theorem 1 for some efficiency issues that emerge 
in the problem of testing multivariate independence. We use a general dependence 
model which is rather popular in the present context. 

Let Xj = (Xji, . . . ,Xim), m > 2, i = 1, . . . ,n, he independent random vec- 
tors with absolutely continuous df F and marginal df 's Fi, . . . , Fm- Consider the 
problem of testing the hypothesis of independence ([T]). 

For the absolutely continuous df F, a copula of F is a (unique) df with uni- 
form univariate margins, such that . . . , Fm^Xm)) = F{xi, . . . , Xm)- When 
testing for independence using distribution- free (independent of Fi, . . . , Fm) test 
statistics, one can assume uniform margins and define the model in terms of cop- 
ulas. 

Let {Fe : 9 > 0} he the family of absolutely continuous copulas Fg that are 
monotone in 9 (the case 9 = corresponds to independe n ce) a n d that s a tisfy the 



following common regularity conditions, cf. iGenest et al.l (120071 ): iQuessyl (120091 ): 



(Al) the density /e(x) = d"^Fg(x.)/dxi . . . dxm admits a square-integrable deriva- 
tive /o(x) of /6i(x) with respect to at = for every x = (xi, . . . , Xm) € 
and the function a/ fe (x) is differentiable in quadratic mean at = 0, i.e., 

j^^(^./J^)-l-h)U^)^ d^ = o{9'), 9^0, 



(A2) Fg{x.) is differentiable with respect to in a small neighborhood of 9 = for 
every x G J*", and the following identity holds for every x G : 

Fo(x) = lim T^Fg{x.) = ... /o(?/i, ...,?/„) d?/i .. . dy^; 



and, in addition. 



/ Fo(x)(ix = lim4 / Fe(x)(ix. 

Jj^ e^o 89 Jim 



The function Fq{x) is sometimes called the dependence function. Assumption 
(A2) implies that Fo{x.) can be written in terms of Fo as follows: 

m 

Fe(x) = JJ X,- + ^Fo(x) + o{9), X G J'", 9^0, 
i=i 

where, by the property of a multivariate copula, the boundary conditions 

Fo(x)U,=o = 0, Fo(x)|x^=i = 0, forany f/G2^^ |[/| =m-l (14) 

are satisfied. As an alternative to i^o • ^ = we consider the hypothesis Hi : 9 > 0. 
In what follows, the underlying df is assumed to belong to the family {Fg : > 0}. 
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The family {Fq : 6 > 0} produces a sequence of locally asymptotically normal 
experiments. Indeed, let Pe be the probability distribution used to calculate the 
df Fg. Then the full observation is a single observation from the product Pg of n 
copies of Pg. In view of condition (Al), the sequence of statistical experiments 
i^h/^ : /i > 0}, indexed by a local parameter h = y/n9, is locally asymptotically 
normal (LAN) at the point h = 0, that is, 

rfP" ^ 1 
log ^pn" (Xi, . . . , X„) = hAnfi - ■^h'^Io + opn{l), n-> oo, 

where Ig = jj^ /|(x)//6i(x) dx is the Fisher information in the family {Fg, 6 > 0} 

and A„^o = ''^'^^'^ Yl^=i foO^i) A(0,/o). The symbol -w denotes convergence in 
PQ-distribution. The LAN property ensures the mutual contiguity of the sequences 
of distributions {P)^/^^} and {Pq}? and facilitates the calculation of Pitman effi- 



ciency of asymptotically normal test statistics (see (Ivan der Vaartl . ll998l Ths. 14.7, 
15.4)). 

4.1 Asymptotic efficiency of independence tests 

Consider testing the hypothesis of independence using distribution-free statistics of 
Xi, . . . , X„ in the case when some of the -F/s, j = 1, . . . , m, are known, while the 
others are unknown. Denote by F„ the multivariate empirical df that corresponds 
to F, and denote by ¥j^n the marginal empirical df based on Xij, . . . ,Xnj, j = 
1, . . . , m. For a set = {ii, . . . and its complement (in M) = (ji, . . . ,jk), 
I + k = m, put 

Fv (x) = Fi. (xi, ) ...Fi, (xi, ) , dFv (x) = dFi. (xi, ) ... dFi, (xi, ) , 



Fv'c,„(x) = Fjj,„,(xjJ . . . Fj^,„,(xjJ, rfFvc_„(x) = dYj^^nixjJ ...d¥^ 



For p = 1,2,..., consider the class {By^ : V C M} of test statistics, cf. iDugue 

(UsSl), 



K,n = I (Fn(x) - Fy(x)Fvc,„(x))PrfFy(x) rfFyc,„(x). 

Suppose that the margins F^, j G V, are known. Then the test statistics -By„, 

V C M, p = 1, 2, . . . , are distribution-free under the null hypothesis, and the study 
of their behaviour under Hq can be done when -F(x) is a uniform distribution on 
J'", which will be assumed from now on. Choosing V = M yields the Cramer-von 
Mises-type statistics 

Bll,n = I (F„(X) -XM)^rfXM, 

where x^ = Xi . . .Xm and dx^ = dx = dxi . . . dxm- On the other hand, setting 

V = ^ leads to the Blum-Kiefer-Rosenblatt-type statistics 

^0,n = / (F„(x)-FM,„(x))^dFM,n(x). 
J/™ 
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An important step in calculating asymptotic efficiency of the sequence of test 
statistics {-By„}n>i lies in showing the weak convergence of 

WvA^) = (^n(x) - XyFyc,„(x)) , X G J'", 

to a limiting Gaussian process under the null hypothesis. This is achieved with 
the aid of Theorem 1, and is stated below as Theorem 2. The Skorohod space 
D{I"^) that appears in the st atement of The orem 2 and generalizes the well-known 
space D[0, 1], is described in iNeuhaud ( Il97ll ). where some of its properties are also 
derived. 



Theorem 2 Assume that -F(x) is a uniform distribution on I"^. Then for any 
V <Z M the empirical process Wv,n{'^) converges weakly in the Skorohod space 
D{I^) to a centered Gaussian process Wv(x) with covariance function QMvi'^ii) 
given by (fT2|) . where the set M.y c 2^^ is defined as follows: 



Ml 



{M,M\U -.U C V',\U\ = 1}. 



(15) 



Proof The proof follows the pattern of ( Neuhau^ . 1971 . Sec. 4, 5), where, among 
others, weak convergence of the process WM,n{'^) = Vn{¥n{'x) — xm) to a tucked 
Brownian sheet is established. Therefore, most details on the convergence of finite- 
dimensional distributions of Wv,n{'^) to a multivariate normal distribution, and 
the proof of tightness of the family of associated probability measures are omitted. 
A pivotal point of our proof is obtaining an expression for the covariance of the 
limiting process. This is accomplished by appealing to Theorem 1. 

For some j ^ V let U = M \ {j}. Then F„(x) 1^.^=1 = Fj,„(xj), so that the 
process Wv,n{'^) is pinned down to zero on {xjy = 1} for any U E Ai such that 
V CU andVl = m - 1. 

Now consider the boundary- value problem (HI), (EI), (jTj) with the set = A^y 
given by ( [T5|) . The respective boundary condition takes the form 



^](x) 



for anjU eM such that V C U, \U\ 



m 



1. 



The function f2(x) equals zero exactly on those sides {x(/ = 1} of the cube I"^ where 
the empirical process Wv,n{'^) vanishes. This observation together with Theorem 1 
gives the required expression for the covariance function. The proof is completed. 



□ 



In the two extreme case s, when 



l) y = M and (2) V = Theorem 2 yields a 



well-known result (see, e.g.. lBlum et al.l (ll96l[ ): lNeuhausl ( 119711 ) : iDeheuvelsl ( 119811 )). 
Indeed, in the first case the set in (|T5|) reduces to A^Af = {M} and the covariance 
function of WmM is 



(16) 
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The respective boundary condition has the form Q G H"^, f2(x)|xjv^=i 
second case (fT5|) becomes 

= {M,M\ {1}, M \ {2}, . . . , M \ {m}}, 

and the covariance function of lV0(x) is 



0. In the 



i=i i=i i^^j i=i 

The respective boundary condition takes the form Q G H™', f2(x)|x[,=i = for any 
f/ G 2^ such that \U\ = m - 1. 

For large sample sizes, the quality of test statistics V C M, p > 1, 

can be judged by looking at their Bahadur efficiency. This kind of asymptotic 
efficiency is quantified by the Bahadur exact slope. Finding the Bahadur exact 
slope of a sequence of test statistics requires the law of large numbers under the 
alternative, and the rough large deviation asymptotics under the null hypothesis. 
The problem of large deviation asymptotics consists in minimizing the KuUback- 
Leibler information over a subset of distribution functions that depends on the 
structure of the test statistic. When p = 1 the above minimization problem is 
reduced, by using variational methods, to the boundary- value problem (HI), (E]), (171) 
with /J, being the Lebesgue measure on J™ and Ai = Aiy given by fflSll . whose 



solution provides the main contribution to the initial problem (see (INikitinl . Il995 
Ch. 5) for details). 

For example, finding the rough large deviation asymptotics of the statistic 



(F„(X) - Fm,„(x)) dFM,n(x) 



is largely reduced to the boundary- value problem (HI), (El) , (17|) w ith M. = A^g and 
for sufficiently small t > 0, cf. formula (5.3.29) of lNikitini (Il995h . 



lim logPHo(5i„ >t) = -hot^ + y2cjt^, 

i>3 

where the series on the right-hand side is convergent and, cf. ([9|) and (fTOl) . 

-1 

(x, I) (ix 



An 



(4/3)" - m/3 - 1 



see (INikitinl . Il995l Sec. 5.3)). Since, under the alternative. 



-^0,n ^ / -^o(x) (ix, n — > cx), 
J/™ 
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where the symbol -4- denotes convergence in Pg-probabihty, it follows from ( IT8|) 
and Theorem 1.2.2 of iNikitinI ( 1l995l ) that the Bahadur exact slope c^i^O) of the 
sequence {-B0„}n>i satisfies as 6* — )■ 



(4/3)™ - m/3 - 1 VA- 



Fq{x.) dx. 



(19) 



Similarly, in a general case, for the model under consideration a routine com- 
putation leads to the following result. 

Proposition 2 For an arbitrary V G M the Bahadur exact slope of the sequence 
{Bvn}n>i satisfies 



-Fo(x) dx. 



9^0, 



where the function QMvi^^^) ^■^ same as in Theorem 2. 

Compared to the evaluation of the local Bahadur efficiency of the tests 

based on p > 2, is much more complicated. For example, when p = 2, the 
efficiency problem is reduced to calculating th e principal eigenvalue of the integral 
operator with kernel Qmv (see (INikitinI . Il995l Ch. 5) for details). 



A more complex empirical pro cess that vanish es co mpletely a t the b oundary 
of J™ was studied, for example, in iNeuhaud (Il97ll ) and iDeheuveld (120051 ). Such a 
process converges weakly to the m-dimensional Brownian pillow with covariance 
function 



(x, c) = J] (ir,(x, i) - k,{x, ^)) , x, ^ G r 



(20) 



which corresponds to the Green function (|T2l) with Ai = 2*^ (see Remark 2), 
and appears in connection with testing multivariate independence in the following 
context. 

Consider PV0^„(x) = A/n(F„(x) — YYjLi^j,n{xj)). The corresponding limiting 
process W^0(x) has the covariance function Q_m^{x., ^) which coincides with ^2*^ (x, ^) 
when m = 2. However, for m > 3 the situation changes. The process W0(x) does 
not vanish completely at the facets of adjacent to the point x = 1, but only 
does so at the one-dimensional edges (see (fTTl) ). This "disappointing" property is 
overco me by the tied-down empirical process (see, e.g.. iNeuhaud (1l97ll ) : iDeheuvels 
(!2005h ) 



n 



X - 



k=l 



\k-l 



X[/ ■ F„(x)|xy=i 

UcM:\U\=k 



xG r 
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which can be equivalently written in the form 

^ n m 



Xj) - Xj) , X G r 



i=l j=l 



Under the null hypothesis, W^0,„(x) converges weakly in the Skorokhod space D{I"^) 
toward an m- dimensional Brownian pillow IV0(x) wit h covariance fun ction fl20|) . 
This fact seems to be established for the first time in iNeuhaua (119711 ) . The cor- 
respon ding; test statistics (for m > 3) take the form, cf. iDuguel (Il975l ): iDeheuvels 



BP 



•'^"^ \ k=l UcM:\U\=k J 



(ix. 



For j9 = 1, under the hypothesis of independence, the lir niting distr i bution of 
^Jni3l^^ is normal with zero mean and variance, cf. Th. 6 of iDeheuveld (Il98l[ ). 



Q2M (x, ^) dx = 12 



Therefore, by Le Cam's third lemma, for all /i > 



A/'(0, 1), n 00, 



where, using ((T 

fim{9) = e 



m-2 

k=l UcM:\U\=k 



(hi. 



The symbol denotes convergence in P^-distributio n of a random sample drawn 
from Fg. In view of Theorem 14.7 in Ivan der VaartI ( 119981 ) . the squared Pitman 
slope of the sequence {i?^„}„>i is 



Am(Q) 
<3-m(0) 



y = 12™(/ (fo(x)-X;(-1)'"' E x^-Fo(x)U=i I cix 

^ V-^^'" V k=l U0M:\U\=k I 



Remark 3 Similar to iy0_„(x) one can get other empirical processes with a limiting 
covariance of the form (|T2l) by subtracting from linear combinations of empirical 
processes of less dimension. 
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4.2 Local asymptotic optimality of independence tests 



An interesting statistical problem that leads to the extremal problem (I3])-(I4]) is 
that of local asymptotic optimality of independence tests. 

Consider testing the hypothesis of independence Hq : 6 = versus the alterna- 
tive Hi : 6 > 0. Two commonly used measures for judging the quality of testing are 
the Pitman slope and the Bahadur local index. Under both approaches, the mea- 
sure of effi ciency of a given test statistic T„ = TfXi , . . . , Xn,) h as an upper bound 



(see, e.g., ( van der Vaart . 1998 . Th. 15.4) and Bahadur f 197ll )). which yields the 
inequality 



&t(Fo)< / /o'(x)rfx. 



(21) 



Here Bt is a homogeneous functional of degree 2 defined on the space that 
depends on a structure of T„ and measures efficiency of the corresponding test. 
For the Bahadur efficiency t he upper bound ( 12T|) is a local version of the Bahadur- 



Ragavachari inequality (see Bahadur ( 197l[ )). For the test based on T„, the closer 



^t(-^o) is to Jj^ /o (x) the better the family {Fg : ^ > 0} is. Thus, in order to 
describe the domain of Bahadur and/or Pitman optimality of the sequence of test 
statistics {T„}, we need to know for which dependence function Fq equality in ( |2T1) 
is attained. If bx is the square of a linear functional, this leads to extremal problem 
(l3])-(jl]). For m = 2 some applicatio ns related to establishing Ba hadur optimality of 
independence tests can be found in iNazarov and NikitinI ( 120001 ). Here we cite two 
examples from lNazarov and NikitinI ( 120001 ) with non-Lebesgue measure /i(x) in the 
problem (I3])-(I1]) that corresponds to the integration over diagonal(s) of I^. These 
examples are connected to testing independence using the Gini rank statistic and 
Spearman's footrule. 

Let Xj = (Xji, . . . , Xim), m > 2, i = 1, . . . , n, be as before. Denote by Rij the 
rank of Xij among Xij, . . . , Xnj, i = 1, . . . ,n, j = 1, . . . ,m. Recall that the Gini 
rank coefficient is defined for m = 2 by 



re 



2 



(|n + 1 — Rii — Ri2\ — \Rii — Ri2\ 



where Z)„ = if n is even and Dn = m? — 1 if n is odd, and inequality f l2T|) takes 
the form (see iNazarov and NikitinI ( 120001 )) 



24 



(Fq{x, x) + Fo{l-x, x)) c/x^ < j^^ /o'(x) rfx. 



(22) 



In this case /i(x) = 5{xi — X2) + 5{1 — Xi — X2), x = (xi, X2) G and equality in 
f l2T|) is attained for the function, cf. Theorem 1, 



Fo(x) = C / 6;^,(x,^)d/i(0, C'>0, 



(23) 



p 



12 



where, in view of ([HD, ^Xj,(x, ^) is given by (!T7|) . Integrating in ( 123|1 yields 
Fo(x) = C {\xi - X2|^ - l^i + X2-lf - 3{xl + X2) + 3(xi + X2 - 1)) , C > 0. 

For the Spearman footrule based on the statistic rf = ^"^^ \Rii —Ri2\ the local 
Bahadur index on the left-hand side of (12T|) equals 



90 



1 N 2 

Fo{x, x) dx 



which corresponds to the measure /i(x) = 6{xi — X2), x = (xi, X2) G Therefore 
the optimal dependence function has the form 



Fnfx) 



c / gMA^,^)dfii$) 



C {\Xi -X2\^ - (Xi + X2f + 1xxX2{x\ + + 2)) , C > 0. 



Another interesting application, w hen m > 2, is connected to P it man op t imalit y 
of a multivariate Spearman's rho, cf. ISchmid and Schmidtl (120071 ): iQuessyl ( 120091 ): 



I . - , . 



i=l k=\ 



+ 1 



where Cm = n ^ Y17=i i"^ — {{n + l)/2)™ is a normalizing factor. The statistic Sm,r. 



is a sample counterpart of the functional Sm{F) 



2'"(m+l) 



{{jFdF,...dFm-2-'^). 



As before, consider the family {Fg : > 0} of absolutely continuous copulas 
that satisfy (Al) and (A2). Under the Pitman approach, 9 = On = h/y/n, where 
/i > is a local parameter, and for all h > 0, cf. (IQuessyl . l2009l . Cor. 1), 



-w A/ (0,1), n — oo. 



O-m(O) 



where UmiO) = I Z^^^l] Jr^ -^o ( x) dx and a^(0) 



(2™— m— 1)^ 

Theorem 15.4 in Ivan der VaartI (119981 ). inequality (|2T]) takes the form 



(4/3)'^ - m/3 - 1 



Fo(x)dx < / /o^(x)cix. 

/m / J Jm 



(24) 



Then, the application of Theorem 1 yields that t he sequenc e of t est statistics 
{'S'm,nLi^j_is Pitman optimal if and only if, cf . ( IStepanoval . l2003l . Ths. 2, 3) 
and lOuessvl . l2009l . Sec. 4.4), 



Fo(x) = C JJx J JJ(2 - Xj) + - (m + 1) , x G J™, C> 0. (25) 
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Indeed, the test based on Sm,n is the "best" for those dependence functions Fq that 
dehver equahty in inequahty (p^ . Thus, taking into account (fT^ . we minimize 
the functional fo(?^) the space H'™ subject to 

/ Fo(x)c/x=l, Fo(x)|x,,=i = 0, for any f/G2^^ =m-l. 

By the results of Sections 2 and 3, including Theorem 1, the functional J^-^ /o (x) (ix 
is minimized when 

Fo(x) = A-i / 6^A,«(x,0rf^, (26) 

where Qmii is given by fll7p . By homogeneity of inequality fl24p the extremal func- 
tion is defined up to a positive constant. Integrating in f l26|) yields (1251) . 

Remark 4 For m = 2 the tes t statistics -Bi and S'm,n are known to be asymp- 



totically equivalent (see, e.g., ( iNikitinl . Il995l Ch. 5)). The results of this section 



extend this property to all m > 2. Indeed, due to ffTI?]) and the square of 
Pitman slope of Sm,n equals the Bahadur local index of B^^. Thus, the respective 
left-hand sides in coincide, and the tests based on B^^ and Sm,n, rn>2, are 
asymptotically equivalent. 
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